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(VIRAGE.030A), filed April 7, 2001 and titled "Video-Enabled Community Building," 

and to U.S. Application No. (VERAGE.029A), filed April 7, 2001 and 

titled "Video-Enabled E-Commerce," which are all hereby incorporated by reference. 

15 Background of the Invention 

Field of the Invention 

The present invention generally relates to the field of accessing and processing 
digital video on a network such as the Internet. More particularly, the invention relates 
to innovative techniques to solve the problem of finding video content on the Internet. 

20 

Description of the Related Technology 

A number of techniques have evolved in recent years as the Internet has grown in 
size and sophistication, including: 

• The use of web servers and HTML delivery to web browsers. 

25 • The use of the application-server model for connecting database information 

with web pages and interactive interfaces for end users. 

• The use of dynamically generated HTML that pulls information from a database 
to dynamically format HTML for delivery to the end user. 

• The use of a template language to merge database output with pre-formatted 
3 0 HTML presentations. 
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• The use of 'cookies' to track individual user preferences as they interact with the 
web pages and applications. 
These and other related web technologies and techniques are in commonplace use and 
readily accessible on the Internet. 
5 In addition to the technologies described above, video indexing technology has 

also emerged, herein referred to as 'video logging'. Video logging is a process that 
incorporates both automated indexing and manual annotation facilities to create a rich, 
fme-grained (in a temporal sense) index into a body of video content. The index 
typically consists of a combination of visual and textual indices that permit time-based 

10 searching of video content. The index may incorporate spoken text, speaker 
identifications, facial identifications, on-screen text, and additional annotations, 
keywords, and descriptions that may be applied by a human user executing the video 
logging application. The Virage VideoLogger® is one example of this type of video 
logging technology that is commercially available. 

15 The delivery of coded media on the Internet requires the encoding of video 

content into one or more coding video formats and efficient delivery of that content to 
the end users. Common coding formats presently in use include RealVideo, Microsoft 
Windows Media, QuickTime, and MPEG. The video logging technology may help to 
orchestrate the encoding of one or more of these formats while the video is being 

20 indexed to ensure that the video index is time-synchronized with the encoded content. 

The final delivery of coded media content to an end user is typically accomplished with 
a wide variety of video serving mechanisms and infrastructure. These mechanisms may 
include basic video servers (such as those from Real, Microsoft, and Apple), caching 
appliances (such as those from CacheFlow, Network Appliance, Inktomi, and Cisco), 

25 and content delivery networks (herein "CDN's", such as those from Akamai, Digital 

Island, iBeam, and Adero). These types of video serving mechanisms deliver media 
content to the end user. 

Coded media such as video, Flash™, SMIL, and similar formats (collectively 
referred to as 'video') is available on the World Wide Web in large quantities. Video 

30 content is available 'on demand' from archives, and is 'webcast' in a live manner 
similar to broadcasts. While there some efforts to provide a "TV Guide" for the live 
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webcasted video (such as Yack and ChannelSeek), there are unfortunately very few 
indexes of archived video content. The only ones that exist are highly localized (they 
only index one site). End users have no central search and access mechanism like those 
that exist for web-based text content using traditional search engines. Moreover, the 
5 content is rapidly changing and growing, and this makes it impossible for individuals 
remain abreast of the content available at any given time. 

What would be desired is the ability to automatically discover and index video content 
existing on web pages. This discovery and indexing process is called 'web crawling' or 
'spidering'. The fundamental concept of spidering is to traverse a set of hyperlinked 

10 documents (web pages) by following the hyperlinks from one page to the next. 

Existing spidering technologies are intended to generate an index of the text content 
found on the pages by parsing the HTML. However, web pages contain many more 
forms of content other than text. They also contain rich media such as images, video, 
and animated graphics (i.e., SMIL, Flash or Shockwave presentations). These types of 

15 content are embedded in HTML statements or sophisticated blocks of scripting 

language (such as JavaScript or VBscript). Existing spiders identify these types of 
content and skip over them. It would be advantageous to locate and identify rich 
content in order to index it. 

Identifying a video URL for indexing may be fairly easy in some cases if the 

20 video content is a simple file linked in a basic HTML "HREF" statement. However, 
most video content is exposed on web pages in a more complex manner using scripting 
languages and meta-container files (like "asx" and ".ram") to make the presentation of 
the video interactive, to specify a play-list of individual videos, or to offer multiple 
choices of bit-rates or formats. Thus, the URL for the content is not explicit, but must 

25 be evaluated by executing the scripting language or parsing the container file in a 
similar way as would a web browser application. Even then, it is necessary to identify 
the multiple versions of a piece of content so that it is only indexed one time. Thus, it 
would be desirable to parse out blocks of script and execute it, and also to use the 
context of the script, video URLs, and surrounding HTML to group versions (varying 

30 by bit-rate and/or coding format) of the same content together. 
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Summary of Certain Inventive Aspects 
The present system and method solves the problem of 'how an individual finds 
video content' by creating a centralized index repository, constantly updated, that can be 
used by individuals to perform searches. The search index is a hosted web application 
called the Internet video guide (IVG) that can be integrated into other websites in a 
variety of ways and also solves a significant interface problem. Since there is 
(probably) no "one right way" to present the spectrum of web video content in a search 
interface, the IVG is designed to be used by existing web search sites (websites like 
Yahoo, Excite, Earthlink, Altavista, etc, called 'portals') to expose various custom- 
tailored interfaces to the end user. It could be a category-browser interface as in Yahoo, 
or a keyword-search interface as in Altavista, or a combination of both. Also, small 
websites with a narrow interest in a particular content area can only expose links or 
search interfaces that exploit a small part of the index. As a hosted web application, it 
gains the efficiencies of a single repository, but the interface flexibility of multiple 
websites. 

Aspects of the system and method locate and identify rich content in order to 
index it. The spider is designed to operate on video or animation (hereafter 'video') 
content in the form of coded video or files, and uses video logging technology to 
generate an index of the video. The index is committed to the IVG repository along 
with a location identifier, such as an Uniform Resource Locator (URL), of the video 
being indexed. 

The URL for the content is not explicit, but is evaluated by executing the 
scripting language or parsing the container file in a similar way as would a web browser 
application. Even then, it is necessary to identify the multiple versions of a piece of 
content so that it is only indexed one time. The present system and method can parse 
out blocks of script and execute it, and also to use the context of the script, video URLs, 
and surrounding HTML to group versions (varying by bit-rate and/or coding format) of 
the same content together. 

A by-product of parsing scripts and dynamic URL construction is that the video 
URL contains not only access to the video content itself, but often also contains the 
mechanism to launch the video for playback in the player window of the site containing 



the video. Player windows are typically highly customized and branded for the site, and 
typically contain navigation elements, branding elements (like logos), and advertising 
elements. When the video index repository is searched for video content, the 
corresponding URL of the video can be used to invoke the specific player window of 
the website containing the video. This capability is far more compelling and 
informative to the user in comparison to just accessing raw video out of context. This 
capability also importantly avoids any rights management issues that arise due to 'deep 
linking', i.e., directly accessing the content and hiding its origin. 

The result of the spidering process is a collection of video URLs that are passed 
(through a queue) to a video logging process. Each URL is accessed, and the video 
content is downloaded or transmitted to the video logging process for indexing. The 
index data is then committed to the main repository of the Internet Video Guide search 
application. 

The present system and method includes a set of 'maintenance' features that the 
spider employs as it is spidering. The maintenance features are similar to existing text- 
locating spiders currently deployed by popular web search sites, and are intended to 
minimize re-indexing of content. The World Wide Web is a dynamic place, and content 
on web pages is changing frequently. But not all of it is changing. The spider uses date 
information to see if a URL needs updating, and it looks for new URLs on pages it has 
previously indexed. It also keeps track of missing URLs (content that was removed), 
and performs integrity checking so as to insure the URL links to a video that still exists. 

The present system and method utilizes techniques where an index of video 
content dispersed across the World Wide Web is generated and stored in a central 
repository called the Internet video guide (IVG). In one embodiment, the IVG is a 
hosted web-based application incorporating innovative mechanisms for collecting the 
index of video content. The system is discussed in two parts: the IVG application 
itself, and an innovative video spidering mechanism that enables the collection and 
indexing of video content on the Web which makes the IVG possible and useful. 

The IVG (also hereinafter referred to as the "Guide") is a hosted application that 
is provided as a service to web portals and other websites that wish to expose access to 
the broad array of video content available on the Internet, either as a whole, or as 



selected subsets (such as medical content, or science content, etc.). The operation of the 
Guide is the conjunction of the mechanism for search application hosting and the 
processes of content gathering. 

The video spidering mechanism of the Guide is an important innovation that 
allows the viable realization of the Guide. Just as text-based Internet spiders enabled 
the existence of traditional web-search services and engines, the video spider enables 
the existence of the IVG. The video spidering technology is similar to existing spiders 
for crawling text-based web pages (i.e., HTML documents), but includes novel new 
aspects. The spider operates on video content in the form of coded video or files, and 
uses video logging technology to generate an index of the video, incorporating both 
automated processing and an option for manual, editorial processes. The index is 
committed to the Guide repository along with the URL information of the video being 
indexed. 

In one aspect of the present invention, there is a system of indexing and 
searching video, comprising a video index generated through a logging mechanism that 
associates the video index with a location identifier of the video, a search engine 
operating on the index, a web server and application logic to perform searches against 
the index and deliver search results to browsers, and a template mechanism configured 
to inject the video into templates at a search site. The system may further comprise a 
spidering module configured to automatically locate and index video content on a 
network. The video index may be additionally generated by human annotation. 

In another aspect of the present invention, there is a method of video directory 
formation, comprising capturing an aggregation of index data from existing sources 
having previously indexed videos, and capturing and indexing videos transmitted using 
a video distribution mechanism. The video distribution mechanism may include 
satellite, cable, and airwaves. 

In another aspect of the present invention, there is a method of video directory 
formation, comprising using a spidering process to gather and maintain video content 
located on a network, and capturing and indexing videos transmitted using a video 
distribution mechanism. The video distribution mechanism may include satellite, cable, 
and airwaves. 



In another aspect of the present invention, there is a method of video directory 
formation, comprising using a spidering process to gather and maintain video content 
located on a network, and capturing an aggregation of index data from existing sources 
having previously indexed videos. 

In another aspect of the present invention, there is a system for sharing indexed 
video, comprising a spider module configured to gather video content from a network, 
and a hosting service in data communication with the spider module, wherein the 
hosting service is configured to share searchable video for customized viewing at 
customer sites. The system may further comprise at least one website configured to 
integrate the video shared by the hosting service into the website using at least one 
search and retrieval metaphor, where the website may comprise a web portal. The 
system may further comprise a search web page having a search form that includes one 
or more fields used to express a query. The hosting service may include a logging 
facility configured to generate an index of the gathered video content, and may further 
comprise a browse web page having category links arranged in a subject hierarchy, with 
leaf nodes of the hierarchy performing pre-defined searches against the index. 

In another aspect of the present invention, there is a method of video spidering, 
comprising traversing a set of hyperlinked documents by following the hyperlinks from 
one page to the next so as to identify digital video, generating a time-based index of the 
video, and storing the index in a repository along with a hyperlinked location identifier 
associated with the video being indexed. The method may further comprise identifying 
multiple versions of a video so that it is only indexed one time. The method may 
further comprise parsing out blocks of script associated with the video, and executing 
the parsed blocks of script so as to identify one or more location identifiers 
corresponding to video segments. The method may further comprise grouping 
differently coded versions of the video together. The method may further comprise 
searching for video content, wherein a corresponding location identifier of the video 
may be used to invoke a specific coded video player of a site containing the video. 

In another aspect of the present invention, there is a method of video spidering, 
comprising traversing a network of linked content including at least one video, 
collecting location identifiers where the video resides on the network, and generating 



time-based metadata through access to the video via the collected video location 
identifiers. 

In another aspect of the present invention, there is a method of video spidering, 
comprising spidering a network of linked content so as to locate at least one video, 
indexing the located video into a video index, and performing maintenance operations 
on the located video. The maintenance operations may include using date information 
to either: (1) reindex a previously located video or (2) index a newly posted video. The 
maintenance operations may include identifying previously indexed video which is 
missing from the video index. The maintenance operations may include making 
integrity checks on the located video. 

In yet another aspect of the present invention, there is a method of video 
spidering, comprising dynamically identifying at least one video on a network, 
accessing content corresponding to the identified video, parsing a script associated with 
the identified video, and launching the identified video for playback on a visual display 
according to the parsed script. 

Brief Description of the Drawings 
The above and other aspects, features and advantages of the invention will be 
better understood by referring to the following detailed description, which should be 
read in conjunction with the accompanying drawings. These drawings and the 
associated description are provided to illustrate various embodiments of the invention, 
and not to limit the scope of the invention. 

Figure 1 illustrates a typical network configuration in which this invention may 
operate. 

Figure 2 is a block diagram of a system architecture overview in accordance 
with one embodiment of the invention. 

Figure 3 is a block diagram showing a high-level view of a set of components of 
the video guide embodiment and a typical structure of a web portal that is a consumer 
of this service in accordance with another embodiment of the invention. 



Figure 4 is a block diagram showing the major modules of the video spidering 
subsystem shown in Figure 2 and Figure 3. 

Figure 5 is a flowchart showing the process of end-user search, browse, and 
retrieval of selected video content found in the guide's repository as performed on the 
architecture embodiments shown in Figures 2 and 3. 

Figure 6 is a flowchart showing the overall process of video spidering and video 
index maintenance as performed on the architecture embodiments shown in Figures 2 
and 3. 

Figure 7 is a flowchart detailing the spidering process shown in Figure 6. 
Figure 8 is a flowchart detailing the uniqueness check process shown in 
Figure 6. 

Figure 9 is a flowchart detailing the video grouping process shown in Figure 6. 
Figure 10 is a flowchart detailing the video harvesting process shown in 
Figure 6. 

Figure 1 1 is a flowchart detailing the video index maintenance process shown in 
Figure 6. 

Detailed Description of Certain Embodiments 
The following detailed description of certain embodiments presents various 
descriptions of specific embodiments of the present invention. However, the present 
invention can be embodied in a multitude of different ways as defined and covered by the 
claims. In this description, reference is made to the drawings wherein like parts are 
designated with like numerals throughout. 

Definitions 

The following provides a number of useful possible definitions of terms used in 
describing certain embodiments of the disclosed invention. 

A network may refer to a network or combination of networks spanning any 
geographical area, such as a local area network, wide area network, regional network, 
national network, and/or global network. The Internet is an example of a current global 
computer network. Those terms may refer to hardwire networks, wireless networks, or 



a combination of hardwire and wireless networks. Hardwire networks may include, for 
example, fiber optic lines, cable lines, ISDN lines, copper lines, etc. Wireless networks 
may include, for example, cellular systems, personal communication services (PCS) 
systems, satellite communication systems, packet radio systems, and mobile broadband 
5 systems. A cellular system may use, for example, code division multiple access 

(CDMA), time division multiple access (TDMA), personal digital phone (PDC), Global 
System Mobile (GSM), or frequency division multiple access (FDMA), among others. 

A website may refer to one or more interrelated web page files and other files 
and programs on one or more web servers, the files and programs being accessible over 

10 a computer network, such as the Internet, by sending a hypertext transfer protocol 
(HTTP) request specifying a uniform resource locator (URL) that identifies the location 
of one of said web page files, wherein the files and programs are owned, managed or 
authorized by a single business entity. Such files and programs can include, for 
example, hypertext markup language (HTML) files, common gateway interface (CGI) 

15 files, and Java applications. The web page files preferably include a home page file that 

corresponds to a home page of the website. The home page can serve as a gateway or 
access point to the remaining files and programs contained within the website. In one 
embodiment, all of the files and programs are located under, and accessible within, the 
same network domain as the home page file. Alternatively, the files and programs can 

20 be located and accessible through several different network domains. 

A web page or electronic page may comprise that which is presented by a 
standard web browser in response to an HTTP request specifying the URL by which the 
web page file is identified. A web page can include, for example, text, images, sound, 
video, and animation. 

25 Content, media content, coded (e.g., encoded or transcoded) media content and 

streaming media content may refer to the delivery of electronic materials such as music, 
videos, software, books, multimedia presentations, images, and other electronic data, for 
example over a network to one or more users. Content data will typically be in the form 
of computer files for video, audio, program, data and other multimedia type content. 

30 However, content data may additionally be in the form of actual physical copies of 

valuable content, for example CD-ROM, DVD, VCR, audio, TV or radio broadcast 
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signals, coded audio and video over networks, or other forms of conveying such 
information. 

A computer or computing device may be any processor controlled device that 
permits access to the Internet, including terminal devices, such as personal computers, 
5 workstations, servers, clients, mini-computers, main-frame computers, laptop 
computers, a network of individual computers, mobile computers, palm-top computers, 
hand-held computers, set top boxes for a television, other types of web-enabled 
televisions, interactive kiosks, personal digital assistants, interactive or web-enabled 
wireless communications devices, mobile web browsers, or a combination thereof. The 

10 computers may further possess one or more input devices such as a keyboard, mouse, 
touch pad, joystick, pen-input-pad, and the like. The computers may also possess an 
output device, such as a visual display and an audio output. One or more of these 
computing devices may form a computing environment. 

These computers may be uni-processor or multi-processor machines. 

15 Additionally, these computers may include an addressable storage medium or computer 
accessible medium, such as random access memory (RAM), an electronically erasable 
programmable read-only memory (EEPROM), programmable read-only memory 
(PROM), erasable programmable read-only memory (EPROM), hard disks, floppy 
disks, laser disk players, digital video devices, compact disks, video tapes, audio tapes, 

20 magnetic recording tracks, electronic networks, and other techniques to transmit or store 

electronic content such as, by way of example, programs and data. In one embodiment, 
the computers are equipped with a network communication device such as a network 
interface card, a modem, or other network connection device suitable for connecting to 
the communication network. Furthermore, the computers execute an appropriate 

25 operating system such as Linux, Unix, a version of Microsoft Windows, Apple MacOS, 
IBM OS/2, or other operating system. The appropriate operating system may include a 
communications protocol implementation that handles all incoming and outgoing 
message traffic passed over the Internet. In other embodiments, while the operating 
system may differ depending on the type of computer, the operating system will 

30 continue to provide the appropriate communications protocols to establish 
communication links with the Internet. 
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The computers may contain program logic, or other substrate configuration 
representing data and instructions, which cause the computer to operate in a specific and 
predefined manner, as described herein. In one embodiment, the program logic may be 
implemented as one or more object frameworks or modules. These modules may be 
5 configured to reside on the addressable storage medium and configured to execute on one 
or more processors. The modules include, but are not limited to, software or hardware 
components that perform certain tasks. Thus, a module may include, by way of example, 
components, such as, software components, object-oriented software components, class 
components and task components, processes, functions, attributes, procedures, 

10 subroutines, segments of program code, drivers, firmware, microcode, circuitry, data, 

databases, data structures, tables, arrays, and variables. 

The various components of the system may communicate with each other and 
other components comprising the respective computers through mechanisms such as, by 
way of example, interprocess communication, remote procedure call, distributed object 

15 interfaces, and other various program interfaces. Furthermore, the functionality 
provided for in the components, modules, and databases may be combined into fewer 
components, modules, or databases or further separated into additional components, 
modules, or databases. Additionally, the components, modules, and databases may be 
implemented to execute on one or more computers. In another embodiment, some of 

20 the components, modules, and databases may be implemented to execute on one or 
more computers external to the website. In this instance, the website includes program 
logic, which enables the website to communicate with the externally implemented 
components, modules, and databases to perform the functions as disclosed herein. 

Content may be prrovided to the video guide facility for processing via many 

25 media sources, including, but not limited to, tape, cable, satellite, or digital files. The 

content may be encoded or transcoded into various coded video formats, for example, 
Real, Windows Media, or QuickTime, and indexed. Indexing may be performed using 
a video logging application, such as the Virage VideoLogger, that analyzes the video 
signal to extract metadata. Metadata is not the video data itself, but instead is data that 

30 is derived by processing performed on the video, audio, or closed caption inputs using 
advanced media analysis algorithms. Human operators may add additional editorial 
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information through a process known in the art as 'clip marking', 
of the visual, audio, and textual elements of the video content, 
corresponding coded video. 



The result is an index 
synchronized with the 



5 Overview of the Internet Video Guide Application 

The Internet (although Internet is used in the name, any computing environment 
or global computer network is part of the invention) video guide (IVG or the "Guide") 
is a hosted application that is provided as a service to web portals and other websites 
that wish to expose access to the broad array of video content available on the Internet, 
10 either as a whole, or as selected subsets (such as medical content, or science content, 
etc.). The operation of the Guide is the conjunction of the mechanism for search 
application hosting and the processes of content gathering. The Guide centralizes video 
index information available from three main sources: 

• An aggregated index of video content derived from individual content owners 
15 who wish their video to be made available through the Guide. These content 

owners might be existing service customers of the Virage video application 
hosting business model ("Guide Affiliates", see Applicant's copending U.S. 

Patent Application No. , filed April 7, 2001, entitled Interactive 

Video Application Hosting, which is hereby incorporated by reference), or they 
20 might be random content owners who submit (and possibly pay a fee to submit) 

their content to the Guide. 

• A proactively gathered index of video content collected by the video spidering 
mechanism to harvest video content from the World Wide Web in general. The 
spidering mechanism is discussed in detail below, and greatly adds to the 

25 efficacy of the Guide since this represents the largest source of video content in 

the Guide's repository. 

• A proactively gathered index of video content collected by capturing and 
indexing public domain content from tape, satellite, cable, or airwaves. These 
source signals are additionally encoded into coded formats and made available 

30 on the Internet by a content distribution network. 
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These bodies of source content allow the Guide to collect and maintain a video 
index in the form of a hosted, centralized repository, and provide an application 
interface that can be made available to web portals and other websites. The central 
repository consists of a fine-grained, time-stamped video index generated through video 
logging processes which can also incorporate human editorial annotations and 
descriptions, and which associates the video index data with the source video via URLs. 
The Guide's application features rely on a web server and application logic to perform 
searches against the video index to deliver search results to the web browser of the end 
user. In one embodiment, searches are conducted using standard text-search technology 
operating on the video index, such as the engines available from Altavista, Verity, and 
so on. The presentation of search results employs commonly used template 
mechanisms to dynamically generate HTML presentations that are distinct from the 
video index data itself, and can be highly customized for each website that is a customer 
of the service. Finally, the Guide also provides standard administration and reporting 
mechanisms that govern the operation, maintenance, and usage statistics of the Guide. 

The separation of the video index from the presentation mechanism implies that 
websites and web portals can integrate the Guide application into their website using a 
variety of search and retrieval metaphors within their unique user interfaces. A search 
web page can be built using a search form consisting of one or more fields used to 
express the query, possibly with options for Boolean operators among the fields. A 
field could be a simple free-text or keyword entry field, or it could be a pull-down list of 
pre-defined selections, or it could be a date constraint. Alternatively, a browse web 
page could be built with category links arranged in a subject hierarchy, with leaf nodes 
of the hierarchy performing pre-defined searches against the index. A combination of 
these approaches is also possible, i.e., a fielded search within a selected category. 
Lastly, a website wishing to only expose a domain-specific subset of the entire video 
index can utilize a search form that has hidden field constraints built into the query. For 
example, the search form can constrain the search to only include video in the category 
"Science", while the end-user enters a free-form keyword search into a standard search 
field. The use of this technique is a straight-forward application of HTML forms in 
conjunction with a scripting mechanism such as Javascript. 
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Overview of the Video Spidering Mechanism 

The video spidering mechanism of the Guide is an important innovation that 
allows the viable realization of the Guide. Just as text-based Internet spiders enabled 
the existance of traditional web-search services and engines, the video spider enables 
the existance of the Internet video guide. 

The video spidering technology is similar to existing spiders for crawling text- 
based web pages (i.e., HTML documents), but includes novel new aspects. The spider 
operates on video content in the form of coded video or files, and uses video logging 
technology such as disclosed in Applicant's copending U.S. Patent Application No. 
09/134,497, entitled "Video Cataloger System With Synchronized Encoders", which is 
hereby incorporated by reference, to generate an index of the video, incorporating both 
automated processing and the option for manual, editorial processes. The index is 
committed to the Guide repository along with the URL information of the video being 
indexed. 

The video spider consists of several distinct modules that collectively implement 
the ability to index Internet video content. The first module is an HTML parsing kernel 
that can parse web pages and follow hyperlinks by emulating the behavior and 
capabilities of a standard web browser. This is similar to the traversal mechanisms of 
traditional, text-based spiders. The second module includes an input queue that can 
accept starting point URLs to begin traversals. The starting point URLs are typically 
entered via an administrative process whereby a human editor can direct the spider to 
prominent, well-known, and high quality content containing video deemed to be of 
interest to a large user population. The third module includes one or more scripting 
language parsers and interpreters to identify and execute blocks of embedded script in 
the pages (such as Javascript, Vbscript, etc.) to evaluate video URLs that are not 
explicit, simple links to video content. The fourth module includes parsing logic for 
container files (containing play-lists) such as .ASX (Microsoft) or .RAM (Real 
Networks) files. The fifth module contains logic for associating and grouping different 
versions of like content. Often, the same video content is available on a web page in a 
variety of coded video formats and bit-rates to accommodate the needs and preferences 
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of end-users. The grouping module is responsible for identifying these multiple 
versions and concluding whether or not they represent the same baseline content. In one 
embodiment, only one version of the content needs to be harvested and indexed for 
search and retrieval purposes. During indexing, all versions of the like content are 
5 associated with the index. The sixth module includes maintenance logic to minimize 
the need to re-index content and verify the continued existance and availability of 
previously indexed content (i.e., "cleaning up dead links"). The seventh module 
includes a harvesting mechanism that places found and unique video URLs into a queue 
for processing by the video logging process. Finally, an automated logging mechanism 

10 processes the harvest-queue of URLs and ingests the video content using the video 

logging process to generate a video index associated with each content URL. 

Collectively, these modules are organized in a processing system that is 
governed by a control module to make the spidering, grouping, maintenance, and 
harvesting system operate. The processes implemented by the above modules generally 

15 operate with a significant degree of concurrancy. For example, maintenance and 
harvesting processes are largely independent and proceed in parallel. The control 
system manages the operation of these processes, I/O queues, and the submission of the 
resulting video indices or changes to the indices to the Guide's central repository. 

20 Description of the Figures 

Figure 1 illustrates a typical network configuration 100 in which this invention 
may operate. However, various other types of electronic devices communicating in a 
networked environment may also be used. An end user 102 communicates with a 
computing environment, which may include multiple server computers 108 or a single 

25 server computer 110 in a client/server relationship on a network communication 
medium 116. In a typical client/server environment, each of the server computers 108, 
110 may include a server program that communicates with a user device 115, which 
may be a personal computer (PC), a hand-held electronic device (such as a PDA), a 
mobile or cellular wireless phone, a TV set, or any number of other electronic devices, 

30 The server computers 108, 110, and the user device 115 may each have any 

conventional general purpose single- or multi-chip microprocessor, for example a 
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Pentium processor, a Pentium Pro processor, a MIPS processor, a Power PC processor, 
an ALPHA processor, or other general purpose processors. In addition, the 
microprocessor may be any conventional special purpose microprocessor such as a 
digital signal processor or a graphics processor. Additionally, the server computers 108, 
5 110 and the user device 115 may be desktop, server, portable, hand-held, set-top, or other 

desired type of computing device. Furthermore, the server computers 108, 110 and the 
user device 115 each may be used in connection with various operating systems, 
including, for example, UNIX, LINUX, Disk Operating System (DOS), VxWorks, 
PalmOS, OS/2, Mac OS, a version of Microsoft Windows, or other operating system. 

10 The server computers 108, 110 and the user device 115 may each include a 

network terminal equipped with a video display, keyboard and pointing device. In one 
embodiment of the network configuration 100, the user device 115 includes a network 
browser 120 used to access the server computers 108, 110. The network browser 120 
may be, for example, Microsoft Internet Explorer or Netscape Navigator. The user 102 

15 at the user device 115 may utilize the browser 120 to remotely access the server 
program using a keyboard and/or pointing device and a visual display, such as a monitor 
118. Although Figure 1 shows only one user device 1 15, the network configuration 100 
may include any number of client devices. 

The network 116 may be any type of electronic transmission medium, for 

20 example, including but not limited to the following networks: a virtual private network, 

a public Internet, a private Internet, a secure Internet, a private network, a public network, 
a value-added network, an intranet, or a wireless gateway. The term "virtual private 
network'* refers to a secure and encrypted communications link between nodes on the 
Internet, a Wide Area Network (WAN), Intranet, or any other network transmission 

25 means. 

In addition, the connectivity to the network 116 may be via, for example, a 
modem, Ethernet (IEEE 802.3), Token Ring (IEEE 802.5), Fiber Distributed Datalink 
Interface (FDDI), Asynchronous Transfer Mode (ATM), Wireless Application Protocol 
(WAP), or other form of network connectivity. The user device 115 may connect to the 
30 network 1 16 by use of a modem or by use of a network interface card that resides in the 
user device 115. The server computers 108 may be connected via a wide area network 
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106 to a network gateway 104, which provides access to the wide area network 106 via 
a high-speed, dedicated data circuit. 

As would be well known to one skilled in the art, devices other than the 
hardware configurations described above may be used to communicate with the server 
5 computers 108, 110. If the server computers 108, 110 are equipped with voice 
recognition or Dual Tone Multi-Frequency (DTMF) hardware, the user 102 may 
communicate with the server computers by use of a telephone 124. The telephone may 
be optionally equipped with a browser 120 and display screen. Other examples of 
connection devices for communicating with the server computers 108, 110 include a 

10 portable personal computer (PC) 126 or a personal digital assistant (PDA) device with a 

modem or wireless connection interface, a cable interface device 128 connected to a 
visual display 130, or a satellite dish 132 connected to a satellite receiver 134 and a 
television 136. Still other methods of allowing communication between the user 102 
and the server computers 108, 1 10 are contemplated by this application. 

15 Additionally, the server computers 108, 110 and the user device 115 may be 

located in different rooms, buildings or complexes. Moreover, the server computers 
108, 110 and the user device 115 could be located in different geographical locations, 
for example in different cities, states or countries. This geographic flexibility which 
networked communications allows is additionally within the contemplation of this 

20 application. 

Figure 2 is a block diagram of a system architecture 200 overview in accordance 
with one embodiment of the invention. In this embodiment, the system architecture 200 
includes a video guide facility 210, which includes a video processing module 214 for 
encoding and indexing public and affiliate video content 212. Although the term 

25 facility is used, components do not necessarily need to be at a common location. The 

media content 212 may be transferred from any device connected to the network 1 16 as 
shown in Figure 1, or may be transferred by other means as a live feed or recorded on a 
physical tape. The video guide facility 210 further includes a hosted video application 
module 216, which receives index data from the video processing module 214. The 

30 hosted video application module 216 communicates with a web portal facility 220 
having a portal website 222. The hosted video application module 216 additionally 
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communicates with a content distribution network 240 for uploading coded video. The 
video guide facility 210 further includes a spider operations module 218 in 
communication with the video processing module 214. 

The portal website 222 communicates with the hosted video application 216 for 
transferring video search requests and receiving search results data. The system 
architecture 200 further includes a communications network 116, such as the Internet. 
The portal website 222 communicates with the content distribution network 240 via the 
Internet 116. The content distribution network 240 is part of a wide variety of video 
serving mechanisms and infrastructure that serve to deliver coded media content 242 to 
the end user 102 via the user device 115. The spider operations module 218 spiders 217 
the Internet 116 and receives relevant content 219 from the Internet 116. In one 
embodiment, the relevant content 219 obtained by the spider operations module 218 
from the Internet 116 is sent to the video processing module 214 for logging and 
indexing. In another embodiment, the spider operations module 218 can include its own 
logging module to generate a video index, in which case this video index is provided to 
the hosted video application 216 by the spider operations module 218. 

The following paragraphs provide a description of the operation of an 
embodiment of the system architecture 200 of Figure 2. A web portal operates a 
website 222, either hosted internally on a portal web server 350 (Figure 3) or outsourced 
to a web-hosting service provider, which delivers their branded interface to end users 
102. The spider operations module 218 provides digital content 219 and the public 
and/or video guide affiliates provide raw media content 212 to the video guide facility 
210 for video indexing and encoding by the video processing module 214. Operation of 
the spider operations module 218 is described hereinbelow. Media content 212 may be 
provided or delivered as analog video tape in any format, as a broadcast, cable, or 
satellite feed, or as digitized video in any format delivered via network communications, 
for example via file transfer protocol ("FTP"). Regardless of its original form, the 
content 212 is processed by the video processing module 214 to encode the content and 
extract index data. The index data may include, for example, keyframes, closed-caption 
text, speaker identifications, facial identifications, or other index data. The content 212 
may additionally undergo an editorial process whereby humans label the video by 
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providing, for example, annotations, descriptions, or keywords. The index and 
annotation information, herein referred to as metadata, is maintained by the hosted 
video application 216, while the coded video is uploaded to the content distribution 
network 240. 

5 In one embodiment, content 212 that is originally in analog form is encoded into 

a digital format in such a way that the time synchronization between the metadata and 
the encoded video is accurate, as is described in U.S. Application No. 09/134,497, 
entitled "Video Cataloger System With Synchronized Encoders". Content 212 that is 
originally in digital form, typically a high-resolution format, is transcoded into an 

10 appropriate format for transmitting. Typically, video content 212 is coded in multiple 
formats (for example RealVideo, Microsoft Windows Media, QuickTime, or MPEG) 
and bit rates (for example modem speed or broadband speed) to offer end users 102 a 
choice of presentation, often depending on individual preferences or Internet connection 
bandwidth. The resulting digital video files from either encoding or transcoding are 

15 uploaded to the content distribution network 240, which delivers the actual coded video 

for display to the end user 102. Once the end user 102 selects an actual segment of 
video content 224 to view, the appropriate video is transmitted from the content 
distribution network 240 to the end user's browser for display via the Internet 116. 
Operation of the spider operations module will be described hereinbelow. 

20 Figure 3 is a block diagram of an architecture 300 of the components of the 

spider operations and IVG in accordance with another embodiment of the system. The 
embodiment of Figure 3 is similar to that in Figure 2, but is depicted in greater detail. 
The architecture 300 shows a high-level view of the IVG, it's major components, and a 
typical structure of a web portal that is a consumer of this service. A main video index, 

25 which includes an IVG video index 334 and an Affililiate video index 332, with time- 
stamped metadata and video URL references is managed by a central Guide server (not 
shown) which hosts an IVG search application 330 on behalf of a portal website 222 (on 
the portal web server 350). In one embodiment, the IVG video index 334 and the 
Affililiate video index 332 can be combined in a single database management system 

30 (dbms) index. 
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Video content is logged and indexed at logging facilities 310 at a service center 
that receives input from two main sources: content identified and processed by the video 
spider operations module 218, and Affiliate and Public content 212 usually (but not 
exclusively) from tape sources. The logging facilities 310 may also include optional 
5 human editing to add annotations to the video index. The video spider operations 
module 218, the IVG application 330 and the logging facilities 310 may all be co- 
located at a single service center, or may be located at separate centers. Content found 
by the spider already exists on the World Wide Web on the Internet 116. This means 
the actual video bits of the content are already available for transmitting (i.e., is coded) 

10 and the video index points to the location of the video bits via a video URL. Affiliate 
and Public content 212 which is processed by the logging facilities 310 is additionally 
encoded and uploaded 312 to a video server 320 or the content distribution network 240 
(Figure 2) to be made available on the Web as coded video. The content distribution 
network 240 can include a caching network such as available from Akamai, Digital 

15 Island, Real Broadcast Network, etc. 

The portal website 222 (the customer of the IVG service) uses the dynamic 
HTML publishing capabilities underlying the server hosting the Guide to present a 
search interface or form 352, a results interface 354, and a video playback interface 356 
to their end users. The video playback interface 356 typically is part of a Web browser 

20 and includes a video player 360. A set of templates 351, such as described in 

Applicant's copending U.S. Patent Application No. , filed April 7, 2001, 

entitled Interactive Video Application Hosting, enables HTML rendering of the search 
forms 352, results data 354 presentation, and video playback 356. Such templates 351 
dictate the graphical look-and-feel of the media presentation to a user of the system. A 

25 set of Guide administration functions 336 enable the web portal customer to maintain 
and modify their template interface to the Guide. The administration functions 336 also 
allow the Guide's own administrators to manage the index and spidering operations. 

Figure 4 is a block diagram showing an architecture 400 of the major modules of 
the video spidering system 218 previously shown in Figures 2 and 3. Each of these 

30 modules represents an independent computer process that typically executes with a large 
degree of concurrency. In one embodiment, the master control of all spidering, 
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harvesting, and maintenance operations is the Control module 410. This module 410 
exposes administrative interfaces 412 for scheduling processing and providing the 
starting 'seed' URLs for the spider to begin traversal. Starting point URLs are managed 
in a queue and passed as jobs to a Video Spider module 420. Control module 410 also 
allows the administrator to balance processing time between a gathering (spidering) 
process on module 420, via a spider management interface 414, a harvesting process on 
a Harvester module 460, via a harvest management interface 416, and a Maintenance 
module 440 for regular, scheduled maintenance operations via a maintenance 
management interface 418. The Control module 410 also interfaces with a Queue 
Management module 430 that communicates with the Video Spider module 420 and the 
Harvester module 460. 

The Video Spider module 420 further includes an HTML parsing kernel 422, a 
script parsing module 424, a uniqueness check logic 426 and a grouping logic 428, 
which will be further described in conjunction with Figure 6 below. The Video Spider 
module 420 communicates with the Queue Management module 430 for storage of data. 
The Queue Management module 430 includes a video URL enqueue function 432 and a 
video URL dequeue function 434 as will be described below. 

The Queue Management module 430 further communicates with and provides 
data to the Harvester module 460. The Harvester module 460 includes a remote control 
of video logging function 462, a metadata storage into search index function 464 and a 
video URL storage into 'known URL' database function 466 as will be described below. 
The Harvester module 460 further communicates with a video logging module 450, 
such as the Virage VideoLogger which may be located at the logging facilities 310 
(Figure 3), and with the Maintenance module 440. The Maintenance module 440 
includes a URL existence checker 442, a URL integrity checker 444, and a modified 
date checker 446 as will be described below. 

Figure 5 is a flowchart showing a process 500 of end-user search, browse, and 
retrieval of selected video content found using the Guide's repository of video metadata 
(in the affiliate video index 332 and the IVG video index 334) and the Guide application 
server. The end-user process 500 of interacting with the Guide 330 (through the user's 
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favorite web portal, such as portal website 222) will be described along with with the 
end-user value of the underlying innovations in the Guide. 

The process 500 begins at start state 502 and moves to state 504 where the end- 
user 102 (Figure 1) visits a search page found on the web portal 222 (customer of the 
Guide). The search page 352 is dynamically composed of text, graphic, and navigation 
elements from the web portal's own web server, such portal web server 350 (Figure 3), 
and a search form emitted by the Guide server at state 506. Alternatively, a category 
browser page, or a combination page including a category browser and a search form 
may be utilized. Proceeding to state 508, the end-user 102 then expresses a query or 
search request in the search form, such as by entering keywords, making pull-down 
selections, and so forth, and submits the query. 

Advancing to state 510, the Guide server then performs the query against the 
video index, and returns results. The query results are dynamically formatted into the 
template-based (351) results presentation screen 354 used by the web portal 222. From 
the results presentation screen 354, the end user 102 can browse the video results by 
inspecting any/all of the keyframes, titles, descriptions, a transcript, and other available 
metadata for the found asset at state 512. The exact presentation is governed by the 
templates (351) used by the web portal 222. Moving to state 514, once the user 102 
selects a specific video for playback, the Guide launches a video player window, such as 
an HTML playback window 356, and invokes the coded video identified by the video 
reference URL. This typically accesses the desired coded video residing on the content 
distribution network 240 (Figure 2) or the video server 320 at state 516. The video 
player window includes an embedded video player, which may be a module such as the 
Real or Windows or QuickTime player, placed within the window 356. The window 
356 may have navigation, advertising, and so forth presented in the context of the 
customer site (portal), while the video player 360 is the technology component for 
decoding the coded video and displaying it. The user is then free to employ the 
standard video playback controls found in player windows (play, pause, stop, fast 
forward, rewind, etc.) at state 518 to navigate and view the decoded video. 

When the user has completed navigating and viewing the video at state 518, 
process 500 advances to a decision state 520 to determine if the user desires to select 
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other query results for playback. If so, process 500 moves back to state 512 where the 
user selects another one of the query results. However, if the user does not desire to 
select other query results for playback as determined at decision state 520, process 500 
continues at a decision state 522 to determine if the user desires to express another 
query. If so, process 500 moves back to state 508 to enter and submit a different query. 
However, if the user does not desire to express another query as determined at decision 
state 522, process 500 completes at end state 524. 

Figure 6 is a flowchart showing the overall process 600 of video spidering and 
video index maintenance, and gives an overview of the collection of subprocesses 
corresponding to the modules previously shown in Figure 4 (these subprocesses will be 
identified with the reference number of a corresponding module). Many of these 
processes operate concurrently. A detailed description of each subprocess is provided in 
conjunction with Figures 7 through 1 1. A Control process corresponding to the Control 
module 410 (Figure 4) manages sets of subprocesses and uses standard techniques for 
monitoring and balancing compute resources for these subprocesses. One set of 
subprocesses is Spidering 422/424, Uniqueness Checking 426, and Grouping 428 which 
collectively identify video URLs for harvesting. The Spidering process 422/426 
traverses web pages and gathers candidate URLs that are passed to the Uniqueness 
Check 426 to avoid re-indexing URLs that are already known. Unique URLs are then 
passed to the Grouping process 428 through a video URL page cache 610 to identify 
like content that exists in various bit rates and formats. 

The Grouping process 428 results in final URLs placed in a video URL harvest 
queue 620. The Control process 410 also maintains the Harvesting process 460 for 
actually processing and indexing the video identified by the final URLs. In one 
embodiment, the resulting metadata is stored in a searchable video index 630 and the 
video URLs are stored in a known video URLs database 640. In one embodiment, the 
searchable video index 630 corresponds to the IVG video index 334 (Figure 3). The 
Control process 410 also operates the periodic (such as, running once every day or 
week) Maintenance process 440 to check link existence, integrity, and potential 
modifications (new content found at a previously known URL). 
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Referring to Figure 7 and also to Figure 6, the main video spidering process 
422/424, previously shown in Figure 6, will now be described. The spidering process 
422/424 begins at state 702 with the starting point URLs being provided by an input 
queue mechanism (coming from the administrative interface 412 (Figure 4) of the 
5 Control process 410 in one embodiment). For each starting point URL, the spider 
process 422/424 visits the page and parses the HTML to recursively traverses links on 
that site at state 704. At each leaf of the recursion tree, the process 422/424 parses the 
HTML to identify video content at state 706. Process 422/424 may find basic (HREF) 
video URLs based on the MIME-type of the link found at state 708. These are the 

10 simple forms of video that can be found and directly processed further, i.e., process 

422/424 proceeds to pass the candidate URL to the Uniqueness Check process 426 
described in conjunction with Figure 8. Or else, process 422/424 may find scripting 
blocks at state 710. The scripting blocks are parsed and executed at state 712 in order to 
evaluate the deeper mechanism for access to the video content. In widespread practice 

15 today are two possible forms: scripts that evaluate to a simple video URL at state 714 
(in which case the system proceeds to pass the candidate URL to the Uniqueness Check 
process 426), or "container" URLs at state 716. Container URLs can accomplish many 
things, such as assembling a play list of video segments that are presented to the user. 
Clip segments might be 'bumpers' (i.e., logos, intros, etc.), or advertising. Advertising, 

20 in particular, should not be indexed as part of the process, and can be easily identified in 
practice because of its origin (e.g., such as from an ad broker like DoubleClick or 
Engage). Therefore, if a container URL is found at state 716, it is parsed at state 718 to 
identify the actual content segment(s) that need to be indexed. Once such an URL is 
parsed out of the container play list, process 422/424 proceeds to state 720 to pass the 

25 candidate URL to the Uniqueness Check process 426 (Figure 8). Concurrently, in one 
embodiment, the spidering process 422/424 moves back to state 702 to get the next 
URL as described above. 

Referring to Figure 8 and also to Figure 6, the video content uniqueness 
checking process 426, previously shown in Figure 6, will now be described. The 

30 process 426 begins at state 802 by accepting a candidate video URL from the spidering 

process 422/424 (Figure 7). Moving to state 804, process 426 performs a database look- 
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up of the candidate URL against the repository of known URLs in database 640. If the 
URL is found to be unique at state 806, it is passed into the URL page cache 610 to be 
processed by the Grouping process 428. Else, at state 808, if the URL is determined to 
be already known, the content is checked for any updates or changes since the last time 
it was indexed. The process 426 checks the date, byte count, and duration (or any 
combination of those criteria or other designated criteria) of the video to see if any of 
these properties have changed. If so, the URL requires re-indexing and process 426 
advances to state 810 where the URL is passed to the URL page cache 610 for further 
processing. At the completion of state 806 or state 810, process 426 ends and transfers 
execution to the grouping process 428 (Figure 9). 

Referring to Figure 9 and also to Figure 6, the video grouping process 428, 
previously shown in Figure 6, will now be described. The process 428 begins at state 
902 by retrieving a next video URL from the page cache 610. Advancing to state 904, 
process 428 applies a set of proximity criteria to generate candidates for grouping. 
URLs are considered proximate if they are physically close together within the HTML 
or scripting blocks. Those candidates identified at state 904 are then passed to a root 
name checking state 906 that looks for common strings in the root of the URL. Most 
often, like content that differs only in format or bitrate will have the same basic 
identifier, with only the final filename or suffix indicating a difference. For example, a 
video on a Real Server might be referenced as "rtsp://server_name/content/news.rm" 
while the same video in Microsoft format would be 
"rtsp://server_name/content/news.asf '. As another example, bit rate differences are 
typically indicated with suffix changes to the filename, such as "news_56k.rm" and 
"news_300k.rm" representing two different bit rates of Real Video (56kbps and 
300kbps, respectively). Common URL root names and differing suffixes are fairly easy 
to parse and identify using string comparisons. If, at state 908, no grouping is found, 
the individual page URLs are passed at state 914 to the final harvest queue 620 for 
processing by the harvest process 460 described in conjunction with Figure 10. If, at 
state 910, a group of like URLs is found, the process 428 proceeds to a selection criteria 
state 912 to select the one, best URL for indexing purposes. The selection criteria is 
designed to balance the requirements of having high quality video signals (not degraded 
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by too much compression), and the bandwidth and computational costs of processing 
and indexing video. These criteria can easily change over time with changes in the cost 
structures for bandwidth and computation, and with changes in available video formats 
(e.g., some formats might be preferable over others at a given bit rate). An example 
5 criteria in use today is selecting the highest bit rate, and to prefer Real Video over other 
formats. Once the selection is made at state 912, the final URL is placed in the Harvest 
Queue 620 at state 914. At the completion of state 914, process 428 ends at a done state 
916. In one embodiment, process 428 is a portion of the spidering process 420, which is 
load-balanced, and thus, processes 422, 424, 426, and 428 occur sequentially. In 
10 another embodiment, some or all of processes 422, 424, 426, and 428 can be separated 
into independent load-balanced processes with corresponding enqueue/dequeue 
mechanisms. 

Referring to Figure 10 and also to Figure 6, the video logging process 450 and 
video harvesting process 460, previously shown in Figure 6, will now be described 

15 collectively as process 460'. The process 460' begins at state 1002 by retrieving the 

next video URL from the harvest queue 620. The harvest process 460' has at least one, 
and typically several, video logging resources 450 under its control. The video URL is 
submitted to the next available video logging resource 450 at state 1004 to ingest the 
video and generate a metadata index of the content. Proceeding to state 1006, the video 

20 logging resource 450 logs the video and generates a batch of time-stamped metadata. 

Continuing at state 1008, an optional human process can occur whereby the human 
provides additional annotations, category selections, or create titles for the content. 
Proceeding to state 1010, process 460' inserts the resulting video index (metadata) into 
the production video index 630 in the Guide's application server and is then available 

25 for search and retrieval operations. Advancing to state 1012, the corresponding 
"known" video URL is also inserted into the optimized database of known video URLs 
640 which is used by the Uniqueness Checking process 426. Process 460' repeats as 
long as there are additional harvest URLs provided by the enqueue/dequeue 
management mechanism 430. At the completion of state 1012 and if there are no 

30 further harvest URLs to process, process 460' ends at a done state 1014. 
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Referring to Figure 1 1 and also to Figure 6, the video index maintenance process 
440, previously shown in Figure 6, will now be described. The process 440 typically 
runs periodically under the control of the Control Process 410. When the maintenance 
process 440 is invoked at start state 1102, it starts processing URLs from the known 
5 video URL database 640 at state 1104. For each URL, process 440 advances to state 
1106 and performs an existence check to see if the URL is still present on the page it 
was originally found. If the URL is not found at state 1 106, process 440 moves to state 
1108 and removes the URL from the known video URLs database 640 and the video 
index 630 of the Guide's server is updated to remove the URL. If the URL is found at 

10 state 1106, this implies the video is still 'published' on the page, and the process 440 
proceeds to an integrity check at state 1110. The integrity check actually follows the 
link to verify that the video can be accessed and that the link is not a 'dead link' 
(resulting in an HTTP error 404: "Link not found" message, for example). If the link is 
deemed a dead link by state 1 1 10, it is removed from the system at state 1 108 as above. 

15 If the link is still valid as determined at state 1110, the process 440 proceeds to a 
modification check at state 1 1 12 to see if the content has been updated or changed since 
it was last indexed. The modification check is similar to the properties check 808 that 
occurs during the Uniqueness Check process 426 (Figure 8). Process 440 checks the 
date, byte count, and duration (or any combination of those criteria or other designated 

20 criteria) of the video to see if any of these properties have changed. If so, the URL 

requires re-indexing and process 440 advances to state 1114 where the URL is passed to 
the video harvest queue 620 for harvesting. If the URL does not require re-indexing as 
determined at state 1112, the process 440 loops back to state 1 104 to begin work on the 
next URL from the known video URLs database 640. If there are no further URLs in 

25 the known video URLs database 640, process 440 ends at a done state 1116. 

As described herein, embodiments of the invention fill the longstanding need in 
the technology for a system whereby a website or web portal can access centralized 
video index information derived by a logging process from random content owners, 
Guide affiliates, proactively gathered public domain content, and proactively harvested 

30 video content from a network (e.g., the Internet) via a video spidering mechanism. The 
video index information can be collected and maintained in a hosted, centralized 
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repository and made available via an application interface, which can be customized, to 
users of the network. 

While the above detailed description has shown, described, and pointed out 
novel features of the invention as applied to various embodiments, it will be understood 
that various omissions, substitutions, and changes in the form and details of the device 
or process illustrated may be made by those skilled in the art without departing from the 
intent of the invention. The scope of the invention is indicated by the appended claims 
rather than by the foregoing description. All changes which come within the meaning 
and range of equivalency of the claims are to be embraced within their scope. 
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